Value function gradient learning for large-scale multistage stochastic programming problems
نویسندگان
چکیده
• A stagewise decomposition algorithm for large-scale stochastic program is proposed. The approximates value functions by predetermined parametric forms. fits the gradients of function to those function. Numerical examples show computational efficiency proposed algorithm. called “value gradient learning” (VFGL) multistage convex programs. VFGL finds parameter values that best fit within a given family. Widely used algorithms programming, such as dual dynamic programming (SDDP), approximate adding linear subgradient cuts at each iteration. Although this approach has been successful problems, nonlinear problems may suffer from increasing size subproblem iteration proceeds. On other hand, fixed number parameters; thus, subproblems remains constant throughout Furthermore, can learn parameters means descent, which it be easil0y parallelized and does not require scenario tree approximation underlying uncertainties. was compared with deterministic equivalent formulation problem SDDP approaches three illustrative examples: production planning, hydrothermal generation, lifetime financial planning problem. generates high-quality solutions computationally efficient.
منابع مشابه
Stochastic Gradient Twin Support Vector Machine for Large Scale Problems
Stochastic gradient descent algorithm has been successfully applied on support vector machines (called PEGASOS) for many classification problems. In this paper, stochastic gradient descent algorithm is investigated to twin support vector machines for classification. Compared with PEGASOS, the proposed stochastic gradient twin support vector machines (SGTSVM) is insensitive on stochastic samplin...
متن کاملStatistical inference of multistage stochastic programming problems
We discuss in this paper statistical inference of sample average approximations of multistage stochastic programming problems. We show that any random sampling scheme provides a valid statistical lower bound for the optimal value of the true problem. However, in order for such lower bound to be consistent one needs to employ the conditional sampling procedure. We also indicate that fixing a fea...
متن کاملAn efficient algorithm for large scale stochastic nonlinear programming problems
The class of stochastic nonlinear programming (SNLP) problems is important in optimization due to the presence of nonlinearity and uncertainty in many applications, including those in the field of process systems engineering. But despite the apparent importance of such problems, the solution algorithms for these problems have found few applications due to the severe computational and structural...
متن کاملInference of statistical bounds for multistage stochastic programming problems
We discuss in this paper statistical inference of sample average approximations of multistage stochastic programming problems. We show that any random sampling scheme provides a valid statistical lower bound for the optimal (minimum) value of the true problem. However, in order for such lower bound to be consistent one needs to employ the conditional sampling procedure. We also indicate that fi...
متن کاملMachine Learning Solution Methods for Multistage Stochastic Programming
This thesis investigates the following question: Can supervised learning techniques be successfully used for finding better solutions to multistage stochastic programs? A similar question had already been posed in the context of reinforcement learning, and had led to algorithmic and conceptual advances in the field of approximate value function methods over the years (Lagoudakis and Parr, 2003;...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: European Journal of Operational Research
سال: 2023
ISSN: ['1872-6860', '0377-2217']
DOI: https://doi.org/10.1016/j.ejor.2022.10.011